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Abstract 

Typical protocols for peer-to-peer file sharing over the Internet divide files to be shared into pieces. 
New peers strive to obtain a complete collection of pieces from other peers and from a seed. In this 
paper we investigate a problem that can occur if the seeding rate is not large enough. The problem is 
that, even if the statistics of the system are symmetric in the pieces, there can be symmetry breaking, 
with one piece becoming very rare. If peers depart after obtaining a complete collection, they can tend 
to leave before helping other peers receive the rare piece. Assuming that peers arrive with no pieces, 
there is a single seed, random peer contacts are made, random useful pieces are downloaded, and peers 
depart upon receiving the complete file, the system is stable if the seeding rate (in pieces per time unit) 
is greater than the arrival rate, and is unstable if the seeding rate is less than the arrival rate. The result 
persists for any piece selection policy that selects from among useful pieces, such as rarest first, and it 
persists with the use of network coding. 

I. Introduction 

Peer-to-peer (P2P) communication in the Internet is provided through the sharing of widely 
distributed resources typically involving end users' computers acting as both clients and servers. 
In an unstructured peer-to-peer network, such as BitTorrent [|21, a file is divided into many pieces. 
Seeds, which hold all pieces, distribute pieces to peers. New peers continually arrive into the 
network; they simultaneously download pieces from a seed or other peers and upload pieces to 
other peers. Peers exit the system after they collect all pieces. 

Determining whether a given P2P network is stable can be difficult. Roughly speaking, the 
aggregate transfer capacity scales up in proportion to the number of peers in the network, but 
it has to be in the right places. Many P2P systems have performed well in practice, and they 
incorporate a variety of mechanisms to help achieve stability. A broad problem, which we address 
in part, is to provide a better understanding of which mechanisms are the most effective under 
various network settings. These mechanisms include 

• Rarest first piece selection policies, such as the one implemented in BitTorrent, whereby 
peers determine which pieces are rarest among their neighbors and preferentially download 
such pieces. 

• Tit-for-tat participation constraints, such as the one implemented in BitTorrent, whereby 
peers are choked off from receiving pieces from other peers unless they upload pieces to 
those same peers. This mechanism provides an important incentive for peers to participate 
in uploading pieces, but it may also be beneficial in balancing the distribution of pieces. 

• Peers dwelling in the network after completing download, to provide extra upload capacity. 
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• Network coding /[7]/, /El/, whereby data pieces are combined to form coded pieces, giving 
peers numerous ways to collect enough information to recover the original data file. 

This paper determines what parameter values yield stability for a simple model of a P2P file 
sharing network. The main model does not include the enhancements mentioned in the previous 
paragraph, but extensions and discussion regarding the above mechanisms is given. The model 
includes a fixed seed in the network that uploads with a constant rate. New peers arrive according 
to a Poisson process, and have no pieces at the time of arrival. Random peer contact is assumed; 
each peer contacts a randomly selected target peer periodically. Random useful piece selection 
is also assumed; each peer chooses which piece to download uniformly at random from the set 
of pieces that its selected target has and it itself does not have. As in the BitTorrent system, we 
assume that new peers arrive with no pieces; in effect a peer must first obtain a piece from another 
peer or the fixed seed before it can begin uploading to other peers. We also assume that peers 
depart as soon as they have completed their collection. 

In a P2P network, the last few pieces to be downloaded by a peer are often rare in the network, 
so it usually takes the peer a long time to finish downloading. This phenomenon has been referred 
to as the delay in endgame mode [|2l (or last piece problem). We refer to the specific situation 
that there are many peers in the network and most of them are missing only one piece which is 
the same for all peers, as the missing piece syndrome. In that situation, peers lucky enough to get 
the missing piece usually depart immediately after getting the piece, so their ability to spread the 
missing piece is limited. 

The main result in this paper is to show, as suggested by the missing piece syndrome, that the 
bottleneck for stability is the upload capacity of the seed. Specifically, if the arrival rate of new 
peers is greater than the seed upload rate, the number of peers in the system converges to infinity 
almost surely; if the arrival rate of new peers is less than the seed upload rate, the system is 
positive recurrent and the mean number of peers in the system in equilibrium is finite. The next 
section gives the precise problem formulation, simulation results illustrating the missing piece 



syndrome, and the main proposition. The proposition is proved in Sections [111] and |W| with the 
help of some lemmas given in the appendix. Section |V] provides extensions of the result, including 
consideration of the enhancement mechanisms mentioned above. In particular, it is shown that 
the region of network stability is not increased if rarest first piece selection policies, or network 
coding policies, are applied. Section |V] also provides a conjecture regarding a refinement of the 
main proposition for the borderline case when the arrival rate is equal to the seeding rate; it is 
suggested that whether the system is stable then depends on the rate that peers contact each other. 

The model in this paper is similar to the flat case of the open system of Massoulie and Vojnovic 
[0, ifTOl . The model in [|9l, [ITOl is slightly different in that, rather than having a fixed seed, it 
assumes that new peers each arrive with a randomly selected piece. A fluid model, based on the 
theory of density-dependent jump Markov processes (see [7J), is derived and studied in [9l, [fTOll . It 
is shown that there is a finite resting point of the fluid ordinary differential equation. The analysis 
in this paper is different and complementary. Rather than appealing to fluid limits, we focus on 
direct stochastic analysis methods, namely using coupling to prove transience for some parameter 
values and the Foster-Lyapunov stability criterion to prove positive recurrence for complementary 
parameter values. Furthermore, our work shows the importance of considering asymmetric sample 
paths even for symmetric system dynamics. Forthcoming work described in [fTTl provides analysis 
of P2P networks with peers having pieces upon arrival, as in ||9l, lITOl . and with peers remaining 
for some time in the system after obtaining a complete collection. 

Some other works related to stability and the missing piece syndrome are the following. The 
instability phenomenon identified in this paper was discovered independently by Norros et al. 
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|fT3ll . Norros et al. [fT3l proved a version of our main proposition for a similar model, for the case 
of two pieces. In the model of [,13,1 a peer receives one piece on arrival, with the distribution of 
the piece number (either one or two) being determined by sampling uniformly from the group 
consisting of a fixed seed and the population of peers already in the system. 

Menasche et al [fTTII pointed out that in their simulation studies, their "smooth download 
assumption" and "swarm sustainability" break down if the seed upload rate is not sufficiently 
large. Leskela et al. flSl investigate stability conditions for a single piece file, or a two piece 
file when the pieces are obtained sequentially, when peers remain in the system for some time 
after obtaining the piece. The earliest papers to analytically study unstructured peer-to-peer files 
systems with arrivals of new peers are [|14l . [1151 . These papers provide simple models in which a 
two dimensional differential equation is used that does not take into account the stages of service 
as peers gain more pieces. 

IL Model formulation and simulations 

The model in this paper is a composite of models in [|9l, IfTOl , ||T6l . It incorporates Poisson 
arrivals, fixed seed, random uniform contacts, and random useful piece selection, as follows. The 
parameters of the model are an integer K >1 and strictly positive constants A,/i, and Us- 

• There are K pieces and F = {1, . . . , K}, so that F indexes all the pieces. 

• The set of proper subsets of F is denoted by C. 

• A peer with set of pieces c, for some c e C, is called a type c peer. 

• A type c peer becomes a type c U {z} peer if it downloads piece i for some i ^ c. 

• A Markov state is x = (xc : c G C), with Xc denoting the number of type c peers, |x| denoting 
the number of peers in the system, and S = 7J\_ denoting the state space of the system. 

• Peers arrive exogenously one at a time with no pieces; the times of arrival form a rate A 
Poisson process. 

• Each peer contacts other peers, chosen uniformly at random from among all peers, for 
opportunities to download a piece (i.e. pull) from the other peers, according to a Poisson 
process of rate > 0. Mathematically, an equivalent assumption is the following. Each peer 
contacts other peers, chosen uniformly at random from among all peers, for opportunities to 
upload a piece (i.e. push) to the other peers, according to a Poisson process of rate > 0. 

• Downloads are modeled as being instantaneous. This assumption is reasonable in the context 
of the previous assumption. 

• Random useful piece selection is used, meaning that when a peer of type c has an opportunity 
to download a piece from a peer of type s, the opportunity results in no change of state if 
s C c. Otherwise, the type c peer downloads one piece selected at random from s — c, with 
all |s — c| possibilities having equal probability. 

• There is one fixed seed, which at each time in a sequence of times forming a Poisson process 
of rate Us, selects a peer at random and uploads a random useful piece to the selected peer. 

• Peers leave immediately after obtaining a complete collection. 

Given a state x, let To(x) denote the new state resulting from the arrival of a new peer. Given 
cGC,l<i<K such that i ^ c, and a state x such that Xc > 1, let Tci(x) denote the new state 
resulting from a type c peer downloading piece i. The positive entries of the generator matrix 
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Q = (g(x, x') : X, x' G S) of the Markov process are given by: 

g(x,To(x)) = A 



g(x,Tc,i(x)) 



K 



s — c 



s-.ids 

if Xc > and i ^ c. 



To provide some intuition, we present some simulation results. Figure [T] shows simulations of 
the system for [/^ = = 1 and = 40 pieces. The first plot shows apparently stable behavior. 
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Fig. 1. Number of peers vs. time. Tiie first plot is for A = 0.6 (dasiied) and A = 0.8 (solid), and the second is for A = 1.2 
(dashed) and A — 1.4 (solid). 

After an initial spike, the number of peers in the system seems to hover around 30 (for A = 0.6) 
or 45 (for A = 0.8), which by Little's law is consistent with a mean time in system around 50 
to 60 time units (or about 25% to 50% larger than the sum of the download times). However, 
the second plot shows that for A = 1.2 or A = 1.4, the number of peers in the system does not 
appear to stabilize, but rather to grow linearly. The explanation for this instability is indicated in 
Figure |2} which shows the time-averaged number of peers that held each given piece during the 
simulations, for A = 0.6 in the first plot and for A = 1.4 in the second plot. The first plot shows 
that the 40 pieces had nearly equal presence in the peers, with piece 7 being the least represented. 
The second plot shows that 39 pieces had nearly equal presence and most of the peers had these 
pieces most of the time, but only a small number of peers held piece 3. The following proposition, 
which is the main result of this paper, confirms that the intuition behind the simulation results is 
correct. 

Proposition II.l (i) If X > Ug then the Markov process is transient, and the number of peers 
in the system converges to infinity with probability one. ( ii) If X < Ug the Markov process with 
generator Q is positive recurrent, and the equilibrium distribution tt is such that 7r(x)|x| < oo. 

In the remainder of this section, we give an intuitive explanation for the proposition, which also 



guides the proof. We first give an intuitive justification of Proposition II.l i), so assume A > Ug- 
Under this condition, eventually, due to random fluctuations, there will be many peers in the 
system that are all missing the same piece. While any of the K pieces could be the missing one, 
to be definite we focus on the case that the peers are missing piece one. A peer is said to be in the 
one club, or to be a one-club peer, if it has all pieces except piece one. We consider the system 
starting from an initial state in which there are many peers in the system, and all of them are in 
the one club. The system then evolves as shown in Figure |3j The large size of the box showing 
the one club indicates that most peers are one club peers. A peer not in the one club is said to be 
a young peer, and a young peer is said to be normal if it does not have piece one and infected if 
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Fig. 2. Average number of peers holding each piece for the duration of the simulations. The first plot is for A = 0.6 and the 
second is for A — 1.4. The dashed lines indicate time-average number of peers in system. 
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Fig. 3. Flows of peers and pieces. Solid lines indicate flows of peers; dashed lines indicate flows of pieces. 



it does have piece one. Since there are so many one club peers to download from, a peer doesn't 
stay young very long; most of the young peers join the one club soon after arrival. However, 
due to the fixed seed uploading pieces, some of the normal young peers become infected peers. 
Those infected peers can infect yet more young peers, thereby forming a branching process. But 
typically the infected young peers do not infect other young peers, so that the branching process 
is highly subcritical. Therefore, the rate of departures from the one club due to uploads of piece 
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one from infected peers is small. Therefore, most peers eventually enter the one club, and the 
main way that peers leave the one club is to receive piece one directly from the fixed seed. So 
the long term arrival rate at the one club is close to A and the departure rate from the one club is 
close to Us- Therefore, the one club can grow at rate close to A — f/s, while the number of young 
peers will stay about constant. These ideas are made precise in the proof. 

To understand why the system is stable for A < f/^, the rough idea is to show that whenever 
there are many peers in the system, no matter what the distribution of pieces they hold, the system 
moves towards emptying out. If there are many peers in the system, one of the following two 
cases holds. The first case is that most of the peers have the same number, say ko, of pieces. 
Intuitively, the worst case would be for all peers with ko pieces to have identical collections of 
pieces, in which case no peer with ko pieces would be useful to another. However, if A < t/^, 
such a state can't persist, because peers with ko pieces get additional pieces from the fixed seed 
at an aggregate rate near Us-, while the long term rate that new peers with exactly ko pieces can 
appear is less than or equal to A. If the system is not in the first case just described, then there 
are at least two sizeable groups of peers, so that all the peers in the first group have one number 
of pieces and all peers in the second group have some larger number of pieces. Then all peers 
in the second group can be helpful to any peer in the first group, so that there will be a large 
rate of downloads. Thus, if there are many peers in the system, no distribution of the pieces they 
hold can persist. To prove stability, it is still necessary to show that the state can't spiral out to 
ever increasing loads through some quasi-periodic behavior. This is achieved through the use of 
a potential function and the Foster-Lyapunov stability criterion. 



III. Proof of instability if A > f/. 



Proposition 



Proposition II. 



II. l i) is proved in this section; it can be read independently of the proof of 



Jii) in the next section. The proof follows along the lines of the intuitive explanation 
given just after the statement of the proposition in Section [11} and an additional explanation of the 
proof is provided in a remark at the end of the section. Assume A > [7^. If K = 1, the system 
reduces to an M/M/1 queueing system with arrival rate A and departure rate Us, in which case 
the number of peers in the system converges to infinity with probability one. So for the remainder 
of this proof assume > 2. To begin: 
. Select e > so that 3e < A - Us- 
. Select ^ > so that e - AK^Us > 0, and 

p<^ where p = 2^(K - 1). (1) 

It follows from Q that ^ < 0.5. 

• Select eo small enough that < ^. 

• Select B large enough that 



gA[2{i^-l)//x+l]2-B 

1 - 2-^° 
GAK^^Us 
2B{e-AKiUs) 



< 0.1, (2) 

< 0.1, (3) 



^ < 0.1, and — < 0.1. (4) 



Select No large enough that jr-^^ < ^■ 



2Be - 2Be 

B 

No~3B 
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We shall use the notions of one club, young peer, and infected young peer, as described in the 
paragraph after Proposition II. 1 For a given time t >0, define the following random variables: 

• At : cumulative number of arrivals, up to time t 

• Nt : number of peers at time t 

• Yt : number of young peers at time t 

• Dt : cumulative number of uploads of piece one by infected peers, up to time t 

• Zt : cumulative number of uploads of piece one by the fixed seed, up to time t 

The system is modeled by an irreducible, countable- state Markov process. A property of such 
random processes is that either all states are transient, or no state is transient. Therefore, to prove 
Proposition II. 1 i), it is sufficient to prove that some particular state is transient. With that in mind, 
we assume that the initial state is the one with No peers, and all of them are one-club peers. Let r 
be the extended stopping time defined by r = min{t > : > ^A^*}, with the usual convention 
that T = oo if Yt < ^Nt for all t. It suffices to prove that 

P{t = oo and lim Nt = +00} > 0.6. (5) 

The equation (|5]) depends on the transition rates of the system out of states such that Y < ^N. 
Thus, we can and will prove ([5]) instead for an alternative system, that has the same initial state, 
and the same out-going transition rates for all states such that F < ^A^, as the original system. The 
alternative system is defined by modifying the original system by letting the rate of downloads 
from the set of one-club peers by each young peer be /imax{^^^^, |}, and the aggregate rate of 
downloads from the fixed seed to the set of young peers be f/^ min{^, ,^}. Note that the rates 
used for this definition are equal to the original ones on the states such that F < ^A^, as required. 
The alternative system has the following two properties: 

1) Each young peer receives opportunities to download from one-club peers at rate greater than 
or equal to /x/2. 

2) The fixed seed contacts the entire population of young peers at aggregate rate less than or 
equal to ^Us- 

For the remainder of this proof we consider the alternative system, but for brevity of notation, 
use the same notation for it as for the original system, and refer to it as the original system. 

The following four inequalities will be established, for e, ^,eo,-B, and No satisfying the condi- 
tions given near the beginning of the section. 

P{At > -B + {X- e)t for all t > 0} > 0.9 (6) 

P{Zt <B + {Us + e)t for all t > 0} > 0.9 (7) 

P{Yt < B + eot for all t > 0} > 0.9 (8) 

P{Dt <B + et for all t > 0} > 0.9 (9) 

Let S be the intersection of the four events on the left sides of Q-Q. Since A^^ is greater than 
or equal to the number of peers in the system that don't have piece one, on S, 
Nt > No + At- Dt - Zt > No-3B + {\- Us - 3e)t for all t > 0. Therefore, on S, for any 
t > 0, 

Yt ^ B + eot 



Nt No-3B + {X-Us- 3e)t 
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Thus, £^ is a subset of the event in ([5]). Therefore, if ([6])-(|9]) hold, P{S} > 0.6, and (|5]) is implied. 
So to complete the proof, it remains to prove (|6])-(|9]). 

The process A is a Poisson process with rate A, and Z is stochastically dominated by a Poisson 
process with rate Us- Thus, both (|6]) and Q follow from Kingman's moment bound (see Lemma 
VI. 2| in the appendix) and the conditions in Q on 5. 



Turning next to the proof of ([8]), we shall use the following observation about stochastic 
domination (the notion of stochastic domination is reviewed in the appendix). The observation is 
a mathematical version of the statement that the number of young peers remains roughly bounded 
because peers don't stay young for long. 

Lemma III.l The process Y is stochastically dominated by the number of customers in an 
M/GI/oo queueing system with initial state zero, arrival rate A, and service times having the 
Gamma distribution with parameters K — 1 and ^/2. 

Proof: The idea of the proof is to show how, with a possible enlargement of the underlying 
probability space, an M/GI/oo system can be constructed on the same probability space as the 
original system, so that for any time t, Yt is less than or equal to the number of peers in the 
M/GI/oo system. Let the M/GI/oo system have the same arrival process as the original system- 
it is a Poisson process of rate A. For any young peer, the intensity of downloads from the one 
club (i.e. from any peer in the one club) is always greater than or equal to yu/2 for the original 
system, where we use the fact 1 — > 1/2, which is true by ([T]) and the assumption K > 2. 
We can thus suppose that each young peer has an internal Poisson clock, which generates ticks 
at rate fx/2, and is such that whenever the internal clock of a young peer ticks, that young peer 
downloads a piece from the one club. We declare that a peer remains in the M/GI/oo system 
until its internal clock ticks K — 1 times. This gives the correct service time distribution, and the 
service times of different peers in the M/GI/oo system are independent, as required. A young 
peer can possibly leave the original system sooner than it leaves the M / GI /oo system, because a 
young peer in the original system can possibly download pieces at times when its internal clock 
doesn't tick. But if a young peer is still in the original system, it is in the M/GI/oo system. 



Given this lemma, ([8]) follows from Lemma VL4 with m in the lemma equal to 2{K — l)/fi 



and e in the lemma equal to e„, and ([2]). It remains to prove (|9]). 

Consider the following construction of a stochastic system that is similar to the original one, 
with random variables that have similar interpretations, but with different joint distributions. We 
call it the comparison system. It focuses on the infected peers and the uploads by infected peers, 
and it is specified in Table |lj 

It should be clear to the reader that both the original system and the comparison system can be 
constructed on the same underlying probability space such that any infected peer in the original 
system at a given time is also in the comparison system. When such a peer becomes infected in 
the original system, we require that it also arrives to the comparison system, it discards all pieces 
it may have downloaded before becoming infected, and it subsequently ignores all opportunities to 
download except those occurring at the times its internal clock (described in the proof of Lemma 



III.l) ticks. Because infected young peers possibly stay longer in the comparison system than in 
the original system, some of the peers in the comparison system correspond to peers that already 
departed from the original system. There can also be some infected peers in the comparison 
system that never existed in the original system because the arrival rate of infected peers to the 
comparison system is greater than the arrival rate for the original system. But whenever there is an 
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TABLE I 

Specification of comparison system 



Original system 


Comparison system 


The fixed seed creates infected peers at a 
rate less than ^Us- 


The fixed seed creates infected peers at rate 


An infected peer creates new infected peers 
at a rate less than ^/i. 


An infected peer creates new infected peers 
at rate 


An infected peer uploads piece one to one- 
club peers at a rate less than or equal to 
fi. 


An infected peer uploads piece one to one- 
club peers at rate fi. 


Just after a peer becomes infected, it re- 
quires at most K — 1 additional pieces, and 
the rate for acquiring those pieces is greater 
than or equal to /^/2. 


After a new infected peer arrives, it must 
download K ~ 1 additional pieces, and the 
rate for acquiring those pieces is /i/2. 



infected peer in the original system, that peer is also in the comparison system, and the following 
property holds. Whenever any one of the following events happens in the original system, it also 
happens in the comparison system: 

• The fixed seed creates an infected peer. 

• An infected peer creates an infected peer 

• An infected peer uploads piece one to a one-club peer 

Events of the second and third type just listed correspond to the two possible ways that infected 
peers can upload piece one. Therefore, the property implies the following lemma, where D is the 
cumulative number of uploads of piece one by infected peers, up to time t, in the comparison 
system. 

Lemma III.2 The process {Dt : t > 0) is stochastically dominated by {Dt : t > 0). 

We can identify two kinds of infected peers in the comparison system-the root peers, which 
are those created by the fixed seed, and the infected peers created by other infected peers. We can 
imagine that each root peer affixes its unique signature on the copy of piece one that it receives 
from the fixed seed. The signature is inherited by all copies of piece one subsequently generated 
from that piece through all generations of the replication process, in which infected peers upload 
piece one when creating new infected peers. In this way, any upload of piece one by an infected 
peer can be traced back to a unique root peer. In summary, the jumps of D can be partitioned 
according to which root peer generated them. Of course, the jumps of D associated with a root 

peer happen after the root peer arrives. Let {Dt : t >0) denote a new process which results when 
all of the uploads of piece one generated by a root peer (in the comparison system) are counted 

at the arrival time of the root peer. Since D counts t he sa me events as D, but does so earlier, 

Dt < Dt for all t > 0. In view of this and Lemma III. 2 it is sufficient to prove (|9]) with D 

replaced by D. 

The random process D is a compound Poisson process. Jumps occur at the arrival times of 
root peers in the comparison system, which form a Poisson process of rate ^Us- Let J denote the 

size of the jump of D associated with a typical root peer. The distribution of J can be described 
by referring to an M/GI/1 queueing system with arrival rate ^/i and service times having the 
distribution of a random variable X which has the Gamma distribution with parameters K — 1 and 
/i/2. Note that p in ([1]) is the usual load factor for the reference queueing system: p = ^pE[X]. 
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The reference queueing system is similar to the number of infected peers in the comparison 
system, except that the customers in the M/GI /I queueing system are served one at a time. We 
have J = Ji + J2, where 

• Ji is the number of infected peers that are descendants of the root peer (not counting the 
root peer itself.) That includes peers directly created by the root peer, peers created by peers 
created by the root peer, and so on, for all generations. Ji has the same distribution as the 
number of customers in a busy period of the reference queueing system, not counting the 
customer that started the busy period. 

• J2 is the number of uploads of piece one to one-club peers by either the root peer or any 
of the descendants of the root peer. The sum of all the times that the root peer and its 
descendants are in the comparison system is the same as the duration, L, of a busy period 
of the reference queueing system. While in the comparison system, those peers upload piece 
one to the one club with intensity ^. So E[J2] = ^iE[L\ and E[Jl] = ^^E[LY + ^E[L\. 

Using this stochastic description, the formulas for the busy period in an M/GJ/l queueing system 
(([18]) and ^ in the appendix), and the facts p < 1/2, E[X] = 2{K - l)//i, and Var(X) = 
(ir-l)(2//i)2, yields 

E[J] = i?[J,]+E[J,] = i±^-l 
< 2[1 + 2{K - 1)] < AK 

and 

^[j,^]<i.p, + i)^]^i±ip!^< 



(i_p)3 -{i-pY 



E[J^] = E[E[Jl\L]]=pE[L]+p'E[L^] 
pE[X] p^E[X^] 

= E[(Ji + J2)2] <2{E[J2]+E[J2]} 

< 16{2 + pE[X]+ p^E[X^]} 

= 16 {2 + 2{K - 1) + A{K - 1) + 4{K - 1)^} 

= 16 {AK^ - 2K] < 6AK^ 

Thus, D is a compound Poisson process with arrival rate of batches equal to ^Us and batch sizes 
with first and second moments of the batch sizes bounded by AK and 64K^ respectively. Hence, 

(|9]) with D replaced by D follows from Corollary 
is complete. 

Remark III.3 We briefly explain why the comparison system was introduced in the above proof, 
to provide a better understanding of the proof technique. The intuitive idea behind the deflnition 
of the comparison system is that it is based on worst case assumptions regarding the number of 
peers that are infected by the flxed seed (i.e. the number of root peers) and the number of uploads 
of piece one that can be caused by each root peer. The advantage is then that the arrivals of root 



VI.3 



and (|3]). The proof of Proposition 



11.11) 
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peers form a Poisson process and the total number of uploads of piece one that can be traced 
back to different root peers are independent in the comparison system, so that Kingman 's bound 
for compound Poisson processes, which is a form of the law of large numbers, can be applied. 



IV. Proof of stability if A < f/. 



Proposition 11. 1 ii) is proved in this section, using the version of the Foster-Lyapunov stability 
criterion given in the appendix, and the intuition given in the last paragraph of Section |ll} 

If V is a function on the state space S, then QV is the corresponding drift function, defined 
by QV{x.) = Xly y^x9(^'y)[^(y) ~ ^(^)]- If' usual, the diagonal entries of Q are defined 
to make the row sums zero, then the drift function is also given by matrix-vector multiplication: 

Wx) = Ey?(x,y)ny). 

Suppose A < Us- Given a state x, let rij(x) = J2cec-\c\=i^c- That is, nj(x) is the number of 
peers with precisely i pieces. When the dependence on x is clear, we write Ui instead of nj(x). We 
shall use the Foster-Lyapunov criteria with the following potential function: V"(x) = J2f=o^ h^ii'^) 
where bo, ■ ■ ■ ,&a'-i are positive constants and $i(x) = ("0+"^+"') _ 

Let -Di(x) denote the sum, over all n,; peers with i pieces, of the download rates of those peers. 
Since any peer with i + 1 or more pieces always has a useful piece for a peer with i pieces, it 
follows that -Di(x) > (ii(x), where 



dAx) 



rii (^Us + fiYl 



j=i+l 



(10) 



We shall write di instead of (ij(x). We have 



g$.(x) 



< 



A [{uq H h rii + 1)2 - (no H V rii 



di [{riQ 



rii - r 



- + 
+ rii 



(A - di) [no + 



< A 



riQ 



Hi + 



+ ni\ + 
1 



2 

\ + di 



rii 



di 



Since QV = Y^fjo^ hQ^i it follows that 

Qnx)<^+fAx; 



(^i - hdi 



(11) 



where = hi + ■ ■ ■ + fox-i for 0<z<iC — 1. In what follows, assume that the constants 
60, • • • , &n are chosen so that 1 = hx^i < bK-2 < ■ ■ ■ < &i < &o and 

A 



h > 



Oi+i for <i < K - 2. 



(12) 



Since Oj+i = aj — 6j, ( 12) is equivalent to 



U,bi - Aa, > for < z < is: - 2. 



(13) 
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The following two lemmas and their proofs correspond to the two cases described in the intuitive 
description given in the last paragraph of Section |Ilj 

Lemma IV.l There exist positive values r],e, and L so that Ql^(x) < — e|x| whenever: |x| > L 
and, for some i, rii > (1 — ^7)1x1. 



Lemma IV.l Let r] be as in Lemma IV.l There exist positive values e' and L' so that QV{x.) < 
— e'|x| whenever: |x| > L' and, for all i, rii < (1 — r])\x.\. 



Lemmas IV.l and IV2 imply that QV{x.) < — min{e', e}|x| whenever |x| > max{L, L'}, so 
that Q and V satisfy the conditions of Proposition VL6 with /(x) = min{e',e}|x| and ^'(x) = 
i/ii where B = max{QV(x) : |x| < max{L, L'}}. Therefore, to complete the proof 



Bl 



{|x||<max{L,L'}} 



of Proposition II. 1 li) it remains to prove Lemmas IV.l and IV.2 



Proof: (Proof of Lemma IV.l ) It suffices to prove the lemma for an arbitrary choice of z. So 
fix i G {0, 1, 2, ...K—1}, and consider a state x such that nj/|x| > 1—r] (and, in particular, Ui > 1). 
Then for any j ^ i, nj/rii = (r;,j/|x|)(|x|/r;,j) < Use ([10]) and ( [TT] ) and an interchange of 

= Ef=V EU) to get 

Qy(x) 



summation (J2f=o^ J2f=^l 



< 



< 




Ui - -] hidi 



rii 



ai 1 -r] 
ni (Us + At E, 



K-l 
j=i+l 



< 



QqA 



1^ V tti l-r]J 



hjUs 
21x1 



(14) 



Notice that according to ([13]), 



lim <; ttj I 1 + 

r)-s>0 



KaQ rj 



and 



lim 

|x|— >CXD 2 X 



X-h{l-r^)U, 
= QiX — hiUs < 

0. 



Thus, if 7] is small enough and |x| is large enough, the quantity within braces in ( [14] ) is negative. 
Therefore, if rj and e are small enough, and L is large enough, 

^.r, X / aQ\ + ni{ai\-hiUs} ^ . . 
QV['x.) < < — e|x| 



under the conditions of the lemma, whenever |x| > L. Lemma IV.l is proved. 
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Proof: (Proof of Lemma IV. 2 ) Let 77 be given by Lemma IV. 1 and consider a state x such 
that nj/|x| < 1 — ?7 for all i. It follows that there exists ii and ^2 with Q<ii<i2<K~l such 
that rii^ > ^ and rii^ > Then 



QV^(x 

< 



< 



< 



an A 
— + 
2 


X oo-ft'A 


On A 

— + 

2 


X ao-ft'A 


- [nil 




OqA 


X ao-ft'A 



OqA 



aoKX 





X 


?7 X 


1 


K 


~ 2 




[- 


2 





2 I 
7] |X| 

1^^ 



7^ 



K 



) |x| 



(15) 



The conclusion of the lemma follows because of the term in (15) that is quadratic in |x| 



V. Generalization and Discussion 

A. General Piece Selection Policies 

A piece selection policy is used by a peer to choose which piece to download whenever it 
contacts another peer. The random useful piece selection policy is assumed above, but the results 
extend to a large class of piece selection policies. Essentially the only restriction needed is that 
if the contacted peer has a useful piece for the contacting peer, then a useful piece must be 
downloaded. This restriction is similar to a work conserving restriction in the theory of service 
systems. In particular, the results hold for a broad class of rarest first piece selection policies. Peers 
can estimate which pieces are more rare in a distributed way, by exchanging information with the 
peers they contact. Even more general policies would allow the piece selection to depend in an 
arbitrary way on the piece collections of all peers. Interestingly enough, the results extend even 
to seemingly bad piece selection policies. For example, it includes the sequential piece selection 
policy, in which peers obtain the pieces in order, beginning with piece one. The sequential policy 
can be viewed as a most abundant first useful piece selection policy, or just the opposite of rarest 
piece first. 

To be specific, consider the following family "H of piece selection policies. Each policy in "H 
corresponds to a mapping h from C x (C U {J^}) x 5 to the set of probability distributions on J^, 
satisfying the usefulness constraint: 

5,x) = l whenever 5 ^ A 

i&B-A 

with the following meaning of h: 

• When a type A peer selects a piece to download from a type B peer and the state of the 
entire network is x, piece i is selected with probability hi(A, B,x.), for i E J'. 

• When the fixed seed selects a piece to upload to a type A peer and the state of the entire 
network is x, piece i is selected with probability hi{A, for i E 

The piece selection policies noted above are included in T-L. 
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Reconsider the proof of transience in Section III under a piece selection policy in l-L. From 
any state it is possible to reach the empty state, and from the empty state it is possible to reach 
a state with one peer in the network having all pieces except some piece i^. From that state, for 
any No > 1, it is possible to reach the state with No peers missing only piece io, and no other 
peers in the network. It may be impossible for to equal one, but by renumbering the pieces 
if necessary, it can be assumed without loss of generality that zq is one. Thus, whatever piece 
selection policy in l-L is applied, beginning from any initial state, for any A^o > 1, in a finite time 
with a positive probability, the system can arrive into the state where there are No peers and all 
of them are one-club peers. Thus, as in Section [111} to prove transience it suffices to show that 
from such an initial state, there is a positive probability that the number of peers converges to 
infinity. The arrival rate of new peers and the upload rate of the seed does not depend on the 
piece selection policy, so (|6]) and Q are valid for any piece selection policies in l-L. Moreover, 



Lemma III.l and Lemma III.2 are valid for any piece selection policies in H because the two 
lemmas depend on the properties that peer selection is uniformly random and the piece selection 
is useful if a useful piece is available. Therefore ([8]) and (|9]) are also valid for any piece selection 



policy in H. Thus, we conclude that the proof of Proposition II. 1 1) in Section III works for any 
piece selection policy in H. 

Reconsider next the proof of positive recurrence in Section llVl but for an arbitrary piece 



selection policy in H. The inequalities developed for the proofs of Lemmas IV. 1 and IV.2 hold 
with the same Lyapunov function; useful piece selection suffices. Thus, if A < f/^, it can be 
shown that the Lyapunov stability condition, namely QV{yi) < — e|x|, for |x| sufficiently large, 
still holds. The final conclusion has to be modified, however, because under some policies in 
"H, the Markov process might no longer be irreducible. For example, with the sequential useful 
piece selection policy, the set of states such that every peer holds a set of pieces of the form 
{1,2,..., J} for some J with < J < — 1, is a closed subset of states, in the terminology 
of classification of states of discrete- state Markov processes. In general, the set of all states that 
are reachable from the empty state is the unique minimal closed set of states, and the process 
restricted to that set of states is irreducible. By a minor variation of the Foster-Lyapunov stability 
proposition, the Lyapunov stability condition implies that the Markov process restricted to that 
closed set of states is positive recurrent, and the mean time to reach the empty state beginning 
from an arbitrary initial state is finite. 

We summarize the discussion of the previous two paragraphs as a proposition. 

Proposition V.l (Stability conditions for general useful piece selection policies) Suppose a useful 
piece selection policy from H is used, for a network with random peer contacts and parameters 
K, X, Us , and ^ as in Section |^ There is a single class of closed states containing the empty 
state, and all other states are transient, (i) If X > Ug then the Markov process is transient, and 
the number of peers in the system converges to infinity with probability one. ( ii) If X < Us the 
Markov process with generator Q restricted to the closed set of states is positive recurrent, the 
mean time to reach the empty state from any initial state has finite mean, and the equilibrium 
distribution n is such that X]x^(^)l^l < 

Thus, with the exception of the borderline case X = n, rarest first piece selection does not increase 
the region of stability, nor does most abundant first piece selection decrease the region of stability. 

B. Network Coding 

Network coding, introduced by Ahlswede, Cai, and Yeung, fTj, can be naturally incorporated 
into P2P distribution networks, as noted in The related work [3j considers all to all exchange 
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of pieces among a fixed population of peers through random contacts and network coding. The 
method can be described as follows. The file to be transmitted is divided into K data pieces, 
mi, 1712, . . . , rriK- The data pieces are taken to be vectors of some fixed length r over a finite field 
¥g with q elements, where q is some power of a prime number. If the piece size is M bits, this 
can be done by viewing each message as an r = [M/log2(g)] dimensional vector over F^. Any 
coded piece e is a linear combination of the original K data pieces: e = J2f=i ^i'^u the vector 
of coefficients {6i, ... ,6k) is called the coding vector of the coded piece; the coding vector is 
included whenever a coded piece is sent. The fixed seed uploads coded pieces to peers, and peers 
exchange coded pieces. In this context, the type of a peer A is the subspace Va of spanned 
by the coding vectors of the coded pieces it has received. Once the dimension of Va reaches K, 
peer A can recover the original message. 

When peer A contacts peer B, suppose peer B sends peer A a random linear combination 
of its coded pieces, where the coefficients are independent and uniformly distributed over Fg. 
Equivalently, the coding vector of the coded piece sent from B is uniformly distributed over Vb- 
The coded piece is considered useful to A if adding it to A's collection of coded pieces increases 
the dimension of Va. Equivalently, the piece from B is useful to A if its coding vector is not in 
the subspace Va^iVb. The probability the piece is useful to A is therefore given by 

P{piece is useful} = = i _ g*m(y^nys)-dim(Vs)^ 

\Vb\ 

If peer B can possibly help peer A, meaning Vb <f- Va (true, for example, if dim{VB) > dim{VA)), 
the probability that a random coded piece from B is helpful to A is greater than or equal to 1 — ^• 
The probability a random coded piece from the seed is useful to a peer A with dim{VA) = K — 1 
is precisely 1 — ^- Therefore, when all peers have the same state and the common state has 

dimension K — 1, the departure rate from the network is t/^ = f/s(l — ^). 

The network state x specifies the number of peers in the network of each type. There are only 
finitely many types, so the overall state space is still countably infinite. Moreover, the Markov 
process is easily seen to be irreducible. 



Reconsider the proof of transience in Section III but now under network coding. Fix any 
subspace V^ of F^ with dimension K — 1. Call a peer a one-club peer if its state is V^ . For any 
No > 1, it is possible to reach the state with No one-club peers and no other peers in the network. 
As before, call a peer a young peer if it is not a one-club peer. In the case of network coding, 
call a peer infected if its state is not a subspace of V~. The only way a peer can become infected 
is by downloading a piece either from the seed or from an infected peer. Lemmas [6] and [ 7] are 
valid for network coding, if the condition A > f/^ is replaced by A > f/^. Moreover, Lemma 



m.i 



and Lemma III. 2 are valid for network coding because the two lemmas depend on the properties 



that peer selection is uniformly random and the rate useful pieces are deli vered by the seed to 



one-club peers is arbitrarily close to Ug. Thus, we conclude that Proposition II. 1 1) in Section 
with Us replaced by Ug, extends to the case of network coding. 



Ill 



Reconsider the proof of positive recurrence in Section |IV| but^ith random useful piece selection 
replaced by network coding as described, and Us replaced by Ug = f/s(l — ^). Suppose the same 
Lyapunov function is used, except the new meaning of ^^(x), or rii for short, is the number of 



peers A with dim{VA) = i- Lemmas |IV. 1| and |IV.2| are valid for network coding, if the condition 
\ < Ug is replaced by A 
condition, namely QV(x) 
stability criterion applies. 



X < Ug is replaced hy X < Ug. Thus, if A < f/s, it can be shown that the Lyapunov stability 
condition, namely QV(x) < — e|x|, for |x| sufficiently large, still holds, and the Foster- Lyapunov 
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We summarize the discussion of the previous two paragraphs as a proposition. 



Proposition V.2 (Stability conditions for network coding based system) Suppose random linear 
network coding with vectors over is used, with random peer contacts and parameters K, \, Us, 
and /i as in Section^ (i) If X > Us{l — ^) then the Markov process is transient, and the number 
of peers in the system converges to infinity with probability one. (ii) If \ < Us{l — -) the Markov 
process is positive recurrent, and the equilibrium distribution n is such that X]x^^^)l^l ^ 

Thus, as g — i- oo, the stability region for the system with network coding converges to that for 
useful piece selection. Network coding has the advantage that no exchange of state information 
among peers is needed because there is no need to identify useful pieces. 



C. Peer Seeds 

In many unstructured peer-to-peer systems, such as BitTorrent, peers often dwell in the network 
awhile after they have collected all the pieces. In effect, these peers temporarily become seeds, 
called peer seeds. The uploading provided by peer seeds is able to mitigate the missing piece 
syndrome and enlarge the stability region. Intuitively, if every peer can upload, on average, just 
one more piece after collecting all pieces, then every peer can help one one-club peer to depart, 
so the missing piece syndrome would not persist. This is explored for the case of K = 1 and 
K = 2 (for the sequential piece selection policy) in [8J and for random useful piece selection 
with arbitrary K >1 in [.17J . 



D. Peer Selection and Tit-for-Tat 

Another way to overcome the missing piece syndrome relies on peer selection policies. For 
instance, if young peers contact infected peers preferentially, or if the seed uploads to young 
peers preferentially, the network can be stabilized by the resulting increase in the number of 
infected peers. So some sort of coordination policy, providing the identification of rare pieces 
and young peers, and the transmission of the rare pieces to the young peers, can counter the 
missing piece syndrome. A mechanism built into BitTorrent, called tit-for-tat operation, may alter 
the peer selection policy enough to yield stability for any choice of A, fi, and Ug- Under tit-for-tat 
operation, peers upload almost exclusively to peers from which they can simultaneously download. 
An obvious benefit of tit-for-tat is to give peers incentive to upload, thereby helping other peers, 
but it also may be effective against the missing piece syndrome. Specifically, tit-for-tat encourages 
one-club peers to reduce their rate of download to the young peers, because the young peers have 
nothing to upload to the one-club members. This increases the amount of time that peers remain 
young, giving them a greater chance to obtain a rare piece from the fixed seed. Also, infected 
peers would preferentially send to young peers, because often a normal young peer and an infected 
young peer would be able to help each other. While it is thus clear that tit-for-tat operation helps 
combat the missing piece syndrome, we leave open the problem of quantifying the effect for a 
specific model. 



E. The Borderline of Stability 

We have shown that, for any /i > 0, the system is stable if X < Us and unstable if X > Us, and 
this result is insensitive to the value of fi and to the piece selection policy, as long as a useful 
piece is selected whenvever possible. While it may not be interesting from a practical point of 
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Fig. 4. Transition rates for fi — <x system for if = 3. 



view, we comment on the case \ = Us- First, we give a precise result for a limiting case of the 
original system, and then we offer a conjecture. 

A simpler network model results by taking a limit as yU — )■ oo. Call a state slow if all peers in 
the system have the same type, which includes the state such that there are no peers in the system. 
Otherwise, call a state fast. The total rate of transition out of any slow state does not depend on /i, 
and the total rate out of any fast state is greater than or equal to n/2. For very large values of n, 
the process spends most of its time in slow states. The original Markov process can be transformed 
into a new one by watching the original process while it is in the set of slow states. This means 
removing the portions of each sample path during which the process is in fast states, and time- 
shifting the remaining parts of the sample path to leave no gaps in time. The limiting Markov 
process, which we call the /i = oo process, is the weak limit (defined as usual for probability 
measures on the space of cadlag sample paths equipped with the Skorohod topology) of the original 
process watched in the set of slow states, as /i — )• oo. By symmetry of the model, the state space 
of the fi = oo process can be reduced further, to 5 = {(0, 0)} U {(n, /c) : n > 1, 1 < k < K — 1}, 
where a state (n, k) corresponds to n peers in the system which all possess the same set of k 
pieces. The positive transition rates of the /x = oo process are given by: 

transition rate condition 

(n, /c) -> (n + 1, fc) A {n,k)eS 

{n,k) ^ {n,k + l) n>l,0<k<K~2 

(n,K -1) ^ (n-l,K -1) [/, n>2,k = K -1 



{1,K-1)^ (0,0) Us 



and the transition rate diagram is pictured in Figure]?] for K = 3. The top layer of states consists 
of those for which all peers have K — 1 pieces. These states correspond to all peers being in 
the one club, or all missing some other piece. From any state the process reaches the top layer 
in mean time less than or equal to ^ + ^j^, and within the top layer the process behaves like 
a birth-death process with birth rate A and death rate Ug- Since such birth-death processes are 
null-recurrent if A = f/^, it follows that the /i = oo model is null-recurrent if X = Ug. 

Consider the original process with \ = Us and finite /i. Suppose the process is in a state with 
a very large one club which includes all or nearly all the peers; let n be the number of peers in 
the one club. New young peers arrive at rate A, and they are in the system for approximately ^ 
time units while they are holding exactly k pieces for < k < K — 2. . Thus, over the short term, 
the mean number of young peers in the system holding k pieces is near ^ for < k < K — 2. 
The average fraction of peers that are young peers holding k pieces is thus approximately ^ 
for < A; < K — 2. The average total rate that young peers holding k pieces become infected 
is dominated by the rate the fixed seed downloads piece one to them and is thus approximately 
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(K^''k)niJ. ' where the factor -jJ^^ comes the assumption of uniform random useful piece selection 
for downloads from the seed. A young peer that becomes infected when it has k pieces will 
eventually release, on average, about K — k — 1 other peers from the one club. Thus, to a first 
order approximation, for large n, the number of peers in the system behaves like a birth-death 
process with arrival rate A and state dependent departure rate f/s(l + where 

""-'K-k-l 



K -k 

k=0 

The elementary theory of birth-death processes shows that a birth-death process with constant birth 
rate A and state-dependent death rate A(l + ^) is positive recurrent if c > 1 and null-recurrent if 
< c < 1. This strongly suggests the following to be true: 

Conjecture V.l If \ = Us, the process is positive recurrent if < fi < fio (^nd is null recurrent 
if fi> /io- 

We also expect similar results to be true for other piece selection policies, but the value of (Xo 
would depend on the piece selection policy. 

VL Appendix 

A. Stochastic comparison 

A continuous-time random process is said to be cadlag if, with the possible exception of a set 
of probability zero, the sample paths of the process are right continuous and have finite left limits. 

Definition VI.l Suppose A = {At : t > 0) and B = {Bt : t > 0) are two random processes, 
either both discrete-time random processes, or both continuous time, cadlag random processes. 
Then A is stochastically dominated by B if there is a single probability space (f^, J^, P), and two 
random processes A and B on {Q,J^,P), such that 

(a) A and A have the same finite dimensional distributions, 

(b) B and B have the same finite dimensional distributions, and 

(c) P{At < Btfor all t} = 1. 

Clearly if A is stochastically dominated by B, then for any a and t, P{At > a} < P{Bt > a}. 

B. Appendix: Kingman 's Moment bound for SII processes 

Let (Xf : t > 0) be a random process with stationary, independent increments with Xq = 0. 
Suppose the sample paths are cadlag (i.e. right-continuous with finite left limits). Suppose 
is finite, so there are finite constants n and such that E[Xt] = fit and Var(Xt) = a'^t for all 
t > 0. Let X* = supt>oXi. 

Lemma VI.2 (Kingman's moment bound jj^ extended to continuous time) Suppose that ^ < 0. 
Then E[X*] < Also, for any B > 0, P{X* > B} < ^f^. 

Proof: For each integer n > 0, let 5" denote the random walk process 5*^ = Xk2-n. Let 
5*"* = sup^^Q Sk- By Kingman's moment bound for discrete time processes, 

Var(5n 



-2E[S^] -2/i 
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Since 5"* is nondecreasing in n and converges a.s. to X*, the first conclusion of the lemma 
follows. The second conclusion follows from the first by Markov's inequality. ■ 

Corollary VI.3 Let C be a compound Poisson process with Co = 0, with jump times given by a 
Poisson process of rate a, and jump sizes having mean mi and mean square value m2. Then for 
all B > and e > ami 

P{Ct <B + etfor all t}>l- J^"^^ (16) 

lB[e — ami) 



Proof: Let Xt = Ct — tt. Then X satisfies the hypotheses of Lemma VL2 with /x = ami 
and = am2. So P{X* > B} < ^^^^ implies ([16]). 



C. A maximal bound for an ISA/ GI / oo queue 

Lemma VI.4 Let M denote the number of customers in an M / GI /oo queueing system, with 
arrival rate A and mean service time m. Suppose that Mq = 0. Then for B,e > 0, 



gA{m+l)2--B 



P{Mt >B + et for some t > 0} < — — (17) 

Proof: Our idea is to find another M / GI /oo system whose number of customers sampled 
at integer times can be used to bound M. Suppose we let every customer for the original process 
stay in the system for one extra unit time after they have been served. Let Af/ be the number 
of customers in this new M / GI /oo system at time t. Note that is also the number in an 
M / GI /oo system, with arrival rate A and mean service time m + 1. By a well-known property of 
M / GI /oo systems, for any time t, iVlf is a Poisson random variable. Since the initial state is zero, 
the mean number in the system at any time t is less than A(m + 1), which is the mean number 
in the system in equilibrium. If Poi(jf) represents a Poisson random variable with mean fi, then 
the Chernoff inequality yields P{Poi(n) > a} < exp(yLi(e^ — 1) —9a),, and taking 6* = ln2 yields 
P{Poi{fi) >a}< e^2~". For any integer i > 1, if t e [i - 1], then Mt < M^{i). Therefore, 

P{Mt > B + et for some t > 0} 

oo 

< P{Mt >B + et for some te{i-l,i]} 

i=l 

oo 



< ^P{M«>i? + e(^-l)} 



i=l 
oo 



< ^ gA(m+l)2-(i?+e(i-l)) 



i=l 



gA(m+l)2--B 

1 -2-^ 
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D. On Busy Periods for M/GI/1 Queues 

Consider an M/GI/1 queue with arrival rate A. Let denote the number of customers served 
in a busy period, let L denote the length of a busy period, and let X denote the service time of 
a typical customer. 

Lemma VL5 Let p = XE[X]. It p < 1 then 

C-(iV.L) = ^ (20) 

The lemma can be proved by the well-known branching process method. Let X denote the service 
time of a customer starting a new busy period. Let Y denote the number of arrivals while the 
first customer is being served. Then, given X = x, the conditional distribution of Y is Poisson 
with mean Ax. View any customer in the busy period that arrives after the first customer, to 
be the offspring of the customer in the server at the time of arrival. This gives the well known 
representation for and L: 

Y 

N = 1 + 

i=l 
Y 



L = X + ^Li 



where {Ni, Li),i > 1 is a sequence of independent random 2-vectors such that for each i, {Ni, L/) 
has the same distribution as {N,L). Using Wald's identity, these equations can be used to prove 
the lemma. 



E. Foster-Lyapunov stability criterion 

Proposition VI.6 Combined Foster-Lyapunov stability criterion and moment bound-continuous 
time (See /El/, /[22]/-j Suppose X is a continuous-time, irreducible Markov process on a countable 
state space S with generator matrix Q. Suppose V, f, and g are nonnegative functions on S 
such that QVix) < — /(x) + (yf(x) for all x G 5, and, for some 6 > 0, the set C defined by 
C = {x : /(x) < (7(x) + 5} is finite. Suppose also that {x : l^(x) < K} is finite for all K. Then X 
is positive recurrent and, ifrr denotes the equilibrium distribution, /(x)7r(x) < Xlx5'(^)^(^)- 
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